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Abstract. We show that predicting the HPP or FHP III lattice gas for 
finite time is equivalent to calculating the output of an arbitrary Boolean 
circuit, and is therefore F -complete: that is, it is just as hard as any other 
problem solvable by a serial computer in polynomial time. 
It is widely believed in computer science that there are inherently sequen- 
tial problems, for which parallel processing gives no significant speedup. 
Unless this is false, it is impossible even with highly parallel processing 
to predict lattice gases much faster than by explicit simulation. More 
precisely, we cannot predict t time-steps of a lattice gas in parallel com- 
putation time 0{log'' t) for any k, or 0{t°') for ol < 1/2, unless the class 
P is equal to the class NC or SP respectively. 

1 Introduction 

Given the initial conditions of a d-dimensional cellular automaton, suppose we 
want to know the state at a site t time-steps in the future. We can do this 
in 0{t'^'^^) steps on a serial computer, or 0{t) steps on a parallel one, simply 
by simulating the CA explicitly and filling in the light-cone above the site in 
question. Thus CA Prediction is in the class P of problems solvable by a 
serial computer (such as a deterministic Turing machine) in polynomial time. 

For large t, it would be very nice if a short-cut were available; if instead of 
having to simulate every step of the CA, we could skip over a lot of it and get 
the answer in polylogarithmic time, i.e. C'(log'^ t) steps for some fc, on a parallel 
computer with a polynomial number of processors. This would put CA Predic- 
tion in the class NC C P of problems solvable by circuits of polylogarithmic 
depth and polynomial size. 

A problem is complete for a complexity class if all other problems in that 
class can be reduced to it; thus if it could be solved quickly, so could all other 
problems in its class. Just as NP-complete problems are believed to require 
a super-polynomial amount of search, P-complete problems are believed to be 
inherently sequential, so that the work needs to be done in step-by-step order 
and cannot be efficiently parallelized. Unless NC — P (which would be almost 
as surprising to computer scientists as if P = NP), then, P-complete problems 
cannot be solved in polylogarithmic parallel time. 



Some non-linear OA's with certain algebraic properties can be predicted in 
NC [9, 10], as can the Lorentz lattice gas of one particle bouncing off of fixed 
scatterers [13]. But CA Prediction is P-complete in general since CA's exist 
that can perform universal computation [5, 7]. Other cellular automata and 
lattice systems that have been shown to be P-complete include majority- voting 
CA's, single spin-flip Ising dynamics, sandpilcs, fluid invasion, and diffusion- 
limited aggregation [8, 11, 12]. 

These systems are P-complete because the following problem, called Circuit 
Value, can be reduced to them: given a Boolean circuit, i.e. a directed graph 
whose nodes are AND, OR and NOT gates, and given the truth values of its 
inputs, is its output true or false? This is P-complete since any deterministic 
Turing machine computation of length t can be converted to a Boolean circuit 
of depth 0{t): it seems inherently sequential, since it is hard to imagine how 
one could calculate the output of an arbitrary circuit without going through it 
level-by-level. 

Then the proofs in [8, 11, 12] work by showing that we can build "wires" to 
carry truth values, and AND and OR gates to connect them, so that the future 
state of a particular site corresponds to the output of the circuit. 

In this paper, wo show that the same can be done for lattice gases in both the 
HPP and FHP III rules, so that Lattice Gas Prediction is P-complete. Thus, 
unless NC = P, lattice gases in two or more dimensions cannot be predicted in 
polylogarithmic parallel time. 

We can draw the finer distinction of whether a polynomial speedup is possible 
in parallel, i.e. whether t time-steps can be predicted in 0{t°') parallel time for 
some a < I. The class of problems for which this is possible is called SP for semi- 
efficient parallel computation [17]. Using the theory of strict P -completeness [1], 
we show that a > 1/2 for lattice gases unless SP = P. We conjecture that this 
lower bound can be improved to q: = 1, in which case it is impossible to predict 
lattice gases faster than by explicit simulation. 

Finally, we end the paper with a discussion of the relevance, or lack of it, of the 
computational complexity of lattice gases to that of continuum hydrodynamics 
and the Navier-Stokes equations. 

We recommend [5, 15] for introductions to NC, P, and P-completeness. 

2 The HPP lattice gas 

The simplest lattice gas is the HPP rule on the square lattice [6] . Unfortunately, 
its continuum limit is non- isotropic; it has an infinite number of unphysical 
conserved quantities, namely the total horizontal (vertical) component of mo- 
mentum in each row (column). Thus it is not a good rule for simulating hydro- 
dynamics. However, it has the basic characteristics of a lattice gas, so we will 
address it first. 

The rule is shown in figure 1. There is only one kind of collision where in- 
teraction takes place, namely when two particles collide head-on; the outgoing 
particles are then rotated by 90°. 



Fig. 1. Two-particle collisions in the HPP lattice gas. Particles colliding head-on are 
rotated 90°, while particles colliding at 90° don't interact. 



Our proof is essentially graphical. Our truth values will be carried by upward- 
moving particles or "wires," which are present if true and absent if false. Then 

we rely on the following facts: 

1. ) Wires can be shifted or delayed. 

2. ) Wires can be split into two carrying the same truth value. 

3. ) Two synchronized wires (with the same y-coordinate) can be combined 
by an AND gate. 

4. ) Wires can be negated. 

5. ) The gates and wires can be arranged in such a way that the "exhaust" 
particles created by collisions do not interact with the wires. 

(1), (2) and (3) are shown in figure 2. The AND gate, for instance, works 
as follows: particles a and 6, if they are present, collide with downward-moving 
particles (shown dashed) to produce sideways-moving ones. If both a and b are 
present, these sideways- moving particles collide to produce vertical particles; we 
take the upward-moving one to be a A b. 

(4) is shown in figure 3. Here, two vertical particles (dashed) collide if a is 
not present, and their output collides with a horizontal particle, producing a 
vertical particle a. If a is present, it collides with the upper dashed particle first, 
preventing this from happening, and a is not produced. 

Finally, we can show (5) as follows. Horizontal exhaust particles don't interact 
with vertical wires because 90° collisions are non-interacting, and as long as no 
two gates have the same y-coordinate, no two horizontal particles will collide 
to create vertical exhaust. To prevent vertical exhaust particles from colliding 
with the wires or with each other, we make sure that the gates have different 
a;-coordinates from each other and from the wires. 

Then we can implement any Boolean circuit, first by converting its gates to 
AND gates and negations (using De Morgan's law, aV6 = o A 6) and then taking 
one AND gate at a time, delaying one wire so that the two are synchronized, and 
combining them as in figure 2. If the number of gates is n, clearly the number of 
particles required is linear in n; if we ensure (5) by giving each gate a different 
a;-coordinate, particles may have to travel a horizontal distance of 0{n) to meet. 
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Fig. 2. Shifting, splitting, and delaying wires, and an AND gate, in the HPP lattice 
gas. Dashed arrows arc part of the gate, or "exhaust" particles that may be produced; 
solid arrows are particles that carry truth values. 




Fig. 3. Negation in the HPP lattice gas. If a is not present then the two vertical dashed 

particles collide, producing a particle which collides with another approaching from the 
right, and a is produced; if a is present (solid) it collides with the upper dashed particle, 
their output misses the left-moving particle, and a is not produced. A dotted arrow 
indicates a particle that isn't there. 



so the total time is at most O(n^). 

In fact, we can improve this time bound by stratifying the circuit into levels 
and doing each level in parallel. Let the inputs be level 0, and let level fc + 1 
consist of those gates whose inputs are at level k or less. Then the depth D of 
the circuit is the level of the outputs, and its width W is the maximum number 
of gates at any one level. 

We can do the AND gates and negations at a given level almost all at once, 
separating them by one site in the ^/-direction as shown in figure 4 so that 
sideways-moving particles don't collide with each other. Particles travel a hori- 
zontal distance 0{W) to meet in their AND gates, after being synchronized by 
delays of 0{W), and then move horizontally by 0{W) so that the next level of 
gates will not interfere with this one. Thus the time and spatial extent of the 
CA wc need is 0( W) for each level; summing over all the levels gives a time and 
length 0{DW) = 0{n). 
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Fig. 4. Doing each level of the circuit at almost the same time, and shifting the wires 
to avoid interference with the next level. Delays are not shown. 

Since the time and length for the lattice gas are linear in the number of gates, 
a Boolean circuit of polynomial size can be reduced to predicting a lattice gas 
for a polynomial amount of time and space. Thus we have shown that HPP 
Lattice Gas Prediction is P-complete.^ 

3 The FHP III lattice gas 

The FHP lattice gas lives on the triangular lattice. Unlike the HPP rule, it 

becomes isotropic in the continuum limit, making it an attractive way to sim- 
ulate the Navier-Stokes equations. We use a variant called FHP HI [4], which 
includes both stationary and moving particles; this allows particles to have a 

varying speed or energy, so that the system has non-trivial thermodynamic be- 
havior. Collisions are shown in figure 5, except for head-on collisions, which 

^ Strictly speaking, for Lattice Gas Prediction to be in P we need to express t in 
unary, so that even for initial conditions with very few particles the size of the input 
is t rather than O(logi). 



non-deterministically rotate the outgoing particles by ±60°. We will not use 
head-on collisions in our simulation. 
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Fig. 5. Two-particle collisions in the FHP III lattice gas. Particles colliding at 120° 
produce a stationary particle and a moving one, and vice versa (conserving momentum 
but not energy), and particles colliding at 60° don't interact. 

Then wc can meet the same 5 criteria as before: 

(1), (2) and (3) are shown in figure 6, where we use stationary particles to 
bend or split wires, and a 120° collision for AND. 

(4) is shown in figure 7. Again, if a is not present then a dashed particle sets 
off two collisions and produces a, while if a is present it collides with the first 
particle and prevents this from taking place. 

(5) is slightly harder here than it was for the HPP rule, since diagonal and 
vertical exhaust can interact. Again, we stratify the circuit into D levels, with 
at most W gates in each. Then note the following facts: 

a. ) Upward-diagonal particles only interact with each other if they are at 
the same y coordinate, i.e. if the displacement between them is horizontal, and 
similarly for downward-diagonal particles. 

b. ) Upward-diagonal and downward particles only interact if the displacement 
between them is 60° from the horizontal, and similarly for downward-diagonal 
and upward particles. 

c. ) Upward-diagonal and downward-diagonal particles only interact (through 
head-on collisions) if their displacement is 30° from the horizontal. 

As shown in figure 8, all of these things can be avoided if we arrange gates 
in each level so that they are displaced from each other by less than 30°, but 
more than 0°, from the horizontal. This is easily accomplished within a rectangle 
whose height and width are linear in W . 

Thus we can simulate circuits level-by-level as we did for the HPP lattice 
gas, using 0{n) time and length for circuits with n gates. So FHP III Lattice 
Gas Prediction is P-complete also. 

We note that a similar result for the FHP HI rule was obtained in [16] under 
the name of "computation universality," but with additional barrier particles of 




Fig. 6. Shifting and splitting wires, and an AND gate, in the FHP III lattice gas. 
Solid dots indicate stationary particles that are initially present; dashed ones may be 
produced as exhaust. 
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Fig. 7. Negation in the FHP III lattice gas. 



infinite mass that other particles bounce off of elastically. 



4 Strict P-completeness and polynomial speedup 

Since Lattice Gas Prediction is P-complete, lattice gases cannot be pre- 
dicted in polylogarithmic parallel time, unless NC = P and all problems in P 
can be parallelized to this extent. But even if NC C P, it might still be possi- 
ble to achieve a polynomial speedup with parallelization, predicting t time-steps 
in parallel time 0{t°') for some a < 1. Such problems are in the class SP of 
semi-efficient parallel computation, introduced in [17]. 



:*: 



Fig. 8. Collisions between exhaust particles only take place if they are displaced by 0°, 
30° or 60° from the horizontal. We can avoid this by arranging gates in each level so 
that they have a small (less than 30° but more than 0°) displacement in the j/-direction. 



Clearly even a polynomial speedup is highly useful from a computational 
point of view; t^l"^ is an enormous improvement over t for large t. For instance, 
Moriarty, Machta and Greenlaw give an algorithm with a < 1 on average for 
diffusion-limited aggregation [14]. They call a the dynamic exponent, and pro- 
pose it as a measure of the extent to which a system's dynamics is dependent 
on its history.^ 

Is SP = P? That is, can all problems in P be parallelized with a < 1? To 
study this, Condon introduced the notion of strict P-complctcncss [1]. Although 
the precise definition is somewhat technical, roughly speaking a problem U is at 
most T(n)-P- complete if, for any other problem 77' S P that can be solved in 
serial time t'{n), there is an NC reduction / from 77' to 77 such that r(|/(n)|) = 
0(t'{n)) (plus a possible factor polylogarithmic in n). Then if we can solve 77 
polynomially faster than T(n) in parallel, then we can solve 77' polynomially 
faster than i'(n), and every problem in P has a polynomial speedup. If 77 in fact 
can be solved in parallel time 0(T(n)), then it is strict T{n)-P- complete. 

Condon's main result is that CIRCUIT Value is strict n^/^-P-complete for 
square circuits, whose width is equal to their depth; that is, that unless SP = P, 
the number of parallel steps needed is proportional to the circuit's depth D = 
W = nV2. 

Our reduction from circuits to lattice gases can be done in NC since it 
consists of a local replacement of gates with groups of collisions. Furthermore, 
the time, size, and number of particles in the CA configuration it gives are all 



* Moriarty and Machta define a in terms of the system's size; using time seems more 
appropriate for a CA. 



linear in n, so the input size is |/(n)| = O(nlogn) if we list each particle's 
initial position in a square of size 0{n). Thus the reduction from square circuits 
to lattice gas states is linear (plus a logarithmic factor), and Lattice Gas 
Prediction is at most f^/^-P-complete. We can say then that lattice gases 
have a dynamic exponent a of at least 1/2 unless SP = P. 

In fact, we conjecture that lattice gases are strict f-P-complete and have a 
dynamic exponent of a = 1, i.e. that they cannot be parallelized to better than 
linear time, so that explicit simulation is the best we can do. This is because the 
degree to which we can parallelize the system seems bounded by the mean free 
path of the particles, which is a function of density but stays roughly constant 
in time. 

One way to prove this would be to show that, for each a < 1, the family of 
circuits with D = n" is strict n"-P-complcte; then CIRCUIT VALUE would be 
strict n-P-complete in general. Unfortunately, the proof in [1] doesn't appear to 
work for circuits narrower than a = 1/2. 

5 Is all this physically relevant? 

We have shown that Lattice Gas Prediction for the HPP and FHP III rules 
is P-complete, and in fact at most t^/^-P-complete. Thus it cannot be solved in 
parallel in polylogarithmic time, or even for a < 1/2, imlcss all problems in P 
are easier to solve than we think; in other words, explicit simulation is probably 
the only way to predict a lattice gas. 

However, this begs the following question: what relevance, if any, do these 
results have to the complexity of the Navier- Stokes equation or continuum hydro- 
dynamics? In the continuum limit we are interested in averages, not the presence 
or absence of a single particle at a single site. Our intuition about turbulence 
is that, as a highly non-linear system, it must be integrated numerically, and 
no short-cuts are possible. But we have not proved that here, as the following 
discussion will show. 

If a fast algorithm existed for the lattice gas CA, then we could use it to 
predict the Navier-Stokes equation quickly: just convert the continuum state 
into a CA state, predict the CA, and average to get the continuum state back. 
Thus the complexity of a discrete system is an upper bound on the complexity 
of its continuum limit (assuming that choosing a CA state and taking averages 
can be done quickly). 

However, the converse is not the case. The heat equation, for instance, is 
linear and easily predictable, even though it is the continuum limit of the com- 
putationally universal "billiard-ball" CA of Fredkin and Toffoli [2]. Thus the 
continuum limit of a CA can be much easier to predict than the CA itself, and 
a hardness result for the CA is not necessarily a hardness result for the corre- 
sponding continuum system. 

It would be extremely valuable if a continuous analog of P-completeness, 
showing that numerical integration is the only way to predict the system, could 
be proved for a naturally occuring partial differential equation. We expect that 



this will require radically different techniques than for the discrete case. In the 
meantime, we hope that similar complexity results — either P-completeness 
proofs or efficient prediction algorithms — can be found for other cellular au- 
tomata of physical interest. 
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